ITERATIVE DECODING 



This nonprovisional application claims the benefit of U.S. provisional 
application No. 60/174,601 entitled "Map Decoding In Channels With Memory" filed on 
January 5, 2000. The Applicant of the provisional application is William Turin (Attorney 
Docket No. 105038). The above provisional application is hereby incorporated by 
reference including all references cited therein. 

BACKGROUND OF THE INVENTION 

1. Field of Invention 

This invention relates to iterative decoding of input sequences. 

2. Description of Related Art 

Maximum a posteriori (MAP) sequence decoding selects a most probable 

information sequence = (X x , X 2 , X T ) that produced the received sequence Y, T == 
(Y„ Y 2 , Y T ). For transmitters and/or channels that are modeled using Hidden Markov 

Models (HMM), the process for obtaining the information sequence X^ that corresponds 
to a maximum probability is difficult due to a large number of possible hidden states as 

well as a large number of possible information sequences X* . Thus, new technology is 
needed to improve MAP decoding for HMMs. 

SUMMARY OF THE INVENTION 

This invention provides an iterative process to maximum a posteriori (MAP) 
decoding. The iterative process uses an auxiliary function which is defined in terms of a 
complete data probability distribution. The MAP decoding is based on an expectation 
maximization (EM) algorithm which finds the maximum by iteratively maximizing the 
auxiliary function. For a special case of trellis coded modulation, the auxiliary function 
may be maximized by a combination of forward-backward and Viterbi algorithms. The 
iterative process converges monotonically and thus improves the performance of any 
decoding algorithm. \ 

The MAP decoding decodes received inputs by minimizing a probability of error. 
A direct approach to achieve^ this minimization results in a complexity which grows 
exponentially with T, where T is the size of the input. The iterative process avoids this 
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complexity by converging on the MAP solution through repeated use of the auxiliary 
function. 

BRIEF DESCRIPTION OF THE DRAWINGS 
The invention is described in detail with reference to the following figures where 
5 like numerals reference like elements, and wherein: 

^>^fshows a diagram of a communication system; 
Fig.^*shows a flow chart of an exemplary iterative process; 
Figs^3-6 show state trajectories determined by the iterative process; 
Fig. ^shows an exemplary block diagram of the receiver shown in Fig. 1 ; 
10 Fig. £^hows a flowchart for an exemplary process of the iterative process for a 

TCM example; and 

Fig. 9 shows step 1004 of the flowchart of Fig. 8 in greater detail. 

DETAILED DESCRIPTION OF PREFERRED EMBODIMENTS 
Fig. 1 shows an exemplary block diagram of a communication system 100. The 
15 communication system 100 includes a transmitter 102, a channel 106 and a receiver 104. 

The transmitter 102 receives an input information sequence l[ (i.e., I„ I 2 , I T ) of length 
T, for example. The input information sequence may represent any type of data including 
analog voice, analog video, digital image, etc. The transmitter 102 may represent a 
speech synthesizer, a signal modulator, etc.; the receiver 104 may represent a speech 
20 recognizer, a radio receiver, etc.; and the channel 106 may be any medium through which 

the information sequence X* (i.e., X 2 , X T ) is conveyed to the receiver 104. The 

transmitter 102 may encode the information sequence if and transmit encoded 

information sequence X* through the channel 106 and the receiver 104 receives 

information sequence Y^ (i.e., Y l5 Y 2 , Y T ). The problem in communications is, of 

25 course, to decode Y* in such a way as to retrieve lj . 

Maximum a posteriori (MAP) sequence decoding is a technique that decodes the 



received sequence Y, T by minimizing a probability of error to obtain X[ (and if a model of 
the transmitter 102 is included, to obtain if). In MAP, the goal is to choose a most 
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probable X* that produces the received Y, T . The MAP estimator may be expressed by 
equation 1 below. 

X^argmaxPr^.Y*) (1) 

5 where Pr() denotes a corresponding probability or probability density function and is 
an estimate ofXf . Equation 1 sets X* to the X* that maximizes ?r(Xj ). 

The Pr(x[,Yj T ) term may be obtained by modeling the channel 106 of the 
communication system 100 using techniques such as Hidden Markov Models (HMMs). 
An input-output HMM X = (S,X,Y,7t,{P(X,Y)}) is defined by its internal states S = { 1,2, 
10 . . .n}, inputs X, outputs Y, initial state probability vector tc, and the input-output 

probability density matrices (PDMs) P(X,Y), XeX, YeY. The elements of P(X,Y), 
PiPijY) = Pr(j,X,Y | i), are conditional probability density functions (PDFs) of input x 
and corresponding output y after transferring from the state i to state j. It is assumed that 

the state sequenceSo = (S 0 > S„ S^, input sequenceX} = (X u X 2 ,...Xd, and output 

1 5 sequence Y/ = (Y„Y 2v . . ,Yt) possess the following Markovian property 

PKS t ,X t ,Y t |s*- 1 ,X;- I ,Y 1 t - 1 ) = Pr(S t ,X t ,Y t |S t . 1 ). 

Using HMM, the PDF of the input sequence Xj and output sequence Y, T may be 
expressed by equation 2 below: 

T 

p T (X[,Y 1 T )=7tr[P^ i ,Y i )l (2) 

20 where 1 is a column vector of n ones, n is a vector of state initial probabilities, and n is a 

number of states in the HMM. Thus, the MAP estimator when using HMM may be 
expressed by equation 3 below: 



X^ = arg max 



(3) 



The maximizatipri required by equation 3 is a difficult problem because all 

25 possible sequences of Xf must be considered. This requirement results in a complexity 
that grows exponentially with T. This invention provides an iterative process to obtain 
the maximum without the complexity of directly achieving the maximization by 
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evaluating equation 2 for all possible X* , for example. In the iterative process, an 
auxiliary function is developed whose iterative maximization generates a sequence of 
estimates for X* approaching the maximum point of equation 2. 

The iterative process is derived based on the expectation maximization (EM) 
algorithm. Because the EM algorithm converges monotonically, the iterative process 
may improve the performance of any decoding algorithm by using its output as an initial 
sequence of the iterative decoding algorithm. In the following description, it is assumed 
that HMM parameters for the channel 106 and/or the transmitter 102 are available either 
by design or by techniques such as training. 

The auxiliary function may be defined in terms of a complete data probability 
distribution shown in equation 4 below. 



T( Z ,X^Y, T )=7I io np i| . li ,(X„Y t ), (4) 



1=1 



where z = i 0 is an HMM state sequence, 7C • is an initial probability vector for state io, 

and Pij(X,Y) are the elements of the matrix P(X,Y). The MAP estimator of equation 1 
can be obtained iteratively by equations 5-9 as shown below. 

x; p+ ,=argmaxQ(X[ 9 X?; p ), p = 0,l,2,... (5) 

where p is a number of iterations and Q( X J , X| ? ) is the auxiliary function which may be 
expressed as 

Q(X^X; p ) = X^ x u. Y . T ) 1 °g( XI/ (^ x ^» Y . T ))- ( 6 ) 

x 

The auxiliary function may be expanded based on equation 4 as follows: 

Q(X^,X^ p )=2 J 2 y Uj (X^ p ) log^CX,, YJ) + C (7) 

t=l i=l j=l 

where C does not depend on X^ , n is a number of states in the HMM and 

y uj (X/; p ) = ctj (Xj" 1 , Y, t_l )p y QL up , Y, ) Pj (X* +1 p , Y*, ) (8) 

where <x,( X| >p , Y/ ) and (J£X* 1>p , Y t T +1 ) are the elements of the following forward and 

v 

backward probability vectors 



s 




Based on equations 5-9, the iterative process may proceed as follows. At p=0, an 
initial estimate of X* 0 is generated. Then, Q(Xf ,X* 0 ) is generated for all possible 



sequences of X* . From equations 7 and 8, Q(X J , X J 0 ) may be evaluated by generating 

5 y Mj ( X^ 0 ) and log (pyP^, YJ) for each t, i, and j . y gj ( X^ 0 ) may be generated by using the 
forward-backward algorithm as shown below: 

ccCX^Hc, a(X; p ,Y 1 , ) = a(X 1 , ;;,Y 1 «- , )P(X, p ,Y t ) ,t=l,2,...T 

P( X? +1 , p , Y T T +1 )=1, P( x; p , Y t T )= P(X , Y, ) P( X? +liP , Y t T +1 ) ,t=T- 1 ,T-2, . . . , 1 
Log (PifiitfYJ) is generated for all possible X, for t=l, 2, T and the (XJs that 
10 maximize D(X* , X* 0 ) are selected as X*, . After X*, is obtained, it is compared with 

Q • If a measure D(X^, ,X^ 0 )of difference between the sequences exceeds a compare 

threshold, then the above process is repeated until the difference measure D(X^ p ,X* p _,) is 

within the threshold. The last X* p for p iterations is the decoded output. The measure of 

difference may be an amount of mismatch information. For example, if X[ is a sequence 
15 of symbols, then the measure may be a number of different symbols between X* p and 

X^ H (Hamming distance); if X* is a sequence of real numbers, then the measure may be 

an Euclidean distance DCX; p ,X?; p J = [^ 1 (X i , p -X iiP . 1 ) 2 ] ,/2 . 

Fig. 2 shows a flowchart of the above-described process. In step 1000, the 

receiver 104 receives the input information sequence Y/" and goes to step 1002. In 
20 step 1002, the receiver 104 selects an initial estimate for the decode output information 

sequence X* 0 and goes to step 1004. In step 1004, the receiver 104 generates Yt,ij(X* p ) 

where p = 0 for the first iteration and goes to step 1006. In step 1006, the receiver 104 
generates all the log (pyCX^Y,)) values and goes to step 1008. 

In step 1008, the receiver 104 selects a sequence X x , that maximizes 
25 Q(X^ p+1 ,X^ p )and goes to step 1010, In step 1010, the receiver 104 compares X* p with 

4 
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Xj p+1 . If the compare result is within the compare threshold, then the receiver 104 goes 

to step 1012; otherwise, the receiver 104 returns to step 1004 and continues the process 

with the new sequence X^, . In step 1012, the receiver 104 outputs X^ and goes to 

step 1014 and ends the process. 

The efficiency of the above described iterative technique may be improved if the 
transmitted sequence is generated by modulators such as a trellis coded modulator 
(TCM). A TCM may be described as a finite state machine that may be defined by 
equations 10 and 1 1 shown below. 

S t+1 = / t (S t , Id (10) 
X t = g t (S t ,I t ) (11) 
Equation 1 0 specifies the TCM state transitions while equation 1 1 specifies the 
transmitted information sequence based on the state and the input information sequence. 
For example, after receiving input I t in state S t , the finite state machine transfers to state 
S t+1 based on S t and I t as shown in equation 10. The actual output by the transmitter 102 
is X t according to equation 11. Equation 10 may represent a convolutional encoder and 
equation 1 1 may represent a modulator. For the above example, the transmitter output 

information sequence X* may not be independent even if the input information 

sequence^ is independent. 

In equation 15, the log(p ij (Y t ,X t )) term may be analyzed based on the TCM state 
transitions because the information actually transmitted X, is related to the source 
information I t by X< = g t (S t , IJ. This relationship between Xj and I t forces many elements 
Py(Y t ,Xt) of P (Y^Xj), to zero since the finite state machine (equations 10 and 11) removes 
many possibilities that otherwise must be considered. Thus, unlike the general case 
discussed in relation to equations 5-9, evaluation of p ij (Y t ,X t ) may be divided into a 
portion that is channel related and another portion that is TCM related. The following 
discussion describes the iterative technique in detail for the TCM example. 

For a TCM system with an independent and identically distributed information 
sequence, an input-output HMM may be described by equations 12 and 13 below. 

V 

p&,Y,)=k s .., p .( x . I Y .)I < 12 > 



7 



where 

= |Pr(I t )if S t+1 = f t (S t ,I t ) 
Ps ' S|+l [0 otherwise ( } 

P c ( Y t | is the conditional PDM of receiving Y t given that X, has been transmitted for 
the HMM of a medium (channel) through which the information sequence is 
transmitted; p s s +i is the probability of the TCM transition from state S t to state S t+ls and 
Pr(It) is the probability of an input I t . Thus, equation 2 may be written as 



p T (I^Y 1 T )=7U c np ! „ tt P c (Y t | X,)l, 



i=l 

where 7U C is a vector of the initial probabilities of the channel states, Xf = g t (S t Jt), and the 

product is taken along the state trajectory S HI = f t (S t for t = 1, 2,...,T. 

If all elements of the input information sequence are equally probable, then the 
MAP estimate may be expressed by equation 14 below. 

if=argm F 7i c nP c (Y,|X t )l, (14) 

ll t=l 

The auxiliary function may be expressed by equations 15-17 below corresponding to 
equations 7-9 above. 

Q^'t) 3 !!! YM j (C)log(p ij (Y t |x^) + C (15) 

t=l i=l j=l 

where ^ = g t (S t Jt) and 

Yu j (Ii, P ) = « i (Y 1 t - 1 I lU)Pc^,lxjft(Y t T +I(P I ll Up ) (16) 

a { (Y?~ x Ijp 1 ) and Pj(Y t T + i p I^i, p ) are the elements of the forward and backward 
probability vectors 

t T 
<X(\f IJ) = 7C c nP c (Y i |X i )and|3(Y t T +I £,)= U F C (Y, X { )1 (17) 

* 

From equation 15, the Viterbi algorithm may be applied with the branch metric 



n p 



m(l,)=E £ Y..ij( I i) lo gPc.ii(Y,|X t )l t=l,2,...,T (18) 



i=i j=r 



to find a maximum of Q( l x , I, p ) which can be interpreted as a longest path leading from 

the initial zero state to one of the states S T where only the encoder trellis is considered. 
The Viterbi algorithm may be combined with the backward portion of the forward- 
backward algorithm as follows. 

1 . Select an initial source information sequence I*o s= Iio>I2o»*-*>Ito 

2. Forward part: 



a. set Ct( Y, w 1° ) = 7C, where 7U is an initial state probability estimate; 



and 



b. for t = 1 , 2, . . . ,T, compute = g t (S t ,1^), 

a(Y;|i; (P ) = a(Y/-Mi;;i)P c (Yjx 1>P x 

where I{ is a prior estimate of I J . 

3 . Backward part: 

a. set p (Yj +I |l£ +I)P ) = 1 and last state transition lengths L(S X ) to 0 

for all the states; 

fort = T,T-l, 1 compute: 

b. X, = g t (S t JO, 

c Y U j (tfj-a, (Y/- 1 IIUOpcj ft I X t J ft (Y t T +I | I* J, 

d. L(S t )=max {L[f t (s t ,I t )]+m(l t )}. This step selects the paths with 

the largest lengths (the survivors). 

e. I t (S t )=argmax{L[f t (S t ,I t )]+m(l t )}. This step estimates I t 

corresponding to the state S t by selecting the I t of the survivor in step d. 

V 

f. ■ |3( Y t T I i; p ) = P C (Y, I P( Y t T +1 | X* ,, p ). 

g. End (of "for" loop). 

4. Reestimate the information sequence: 
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!t, P+ i =I t (S t ), S t+1 =f t (S t ,I tp+1 ), t = 1,2,...,T where S, =0; and 
5. If I t p+I * I t p , go to step 2; otherwise decode the information sequence as 



I T 



Figs. 3-6 show an example of the iterative process discussed above where there 
are four states in the TCM and T = 5. The dots represent possible states and the arrows 
represent a state trajectory that corresponds to a particular information sequence. The 
iterative process may proceed as follows. First, an initial input information sequence 

if o is obtained, if 0 may be the output of an existing decoder or may simply be a guess. 

The Viterbi algorithm together with the backward algorithm may be used to 

obtain a next estimate of the input information sequence If , . This process begins with 

the state transitions between t = 4 and t = 5 by selecting state transitions leading to each 
of the states s0-s3 at t = 4 from states at t = 5 that have the largest value of the branch 
metric L(S 4 ) = m(I 5 ) of equation 1 8 above. Then, the process moves to select state 
transitions between the states at t = 3 and t = 4 that have the largest cumulative distance 
L(S 3 ) = L(S 4 ) + m(I 4 ). This process continues until t = 0 and the sequence of input 

information if corresponding to the path connecting the states from t = 0 to t = 5 that has 

r 

the longest path L(S 0 ) = ^ m(It) is selected as the next input information sequence If t . 

-u 

For the example in Fig. 3, state transitions from the states at t = 4 to all the states 
at t = 5 are considered. Assuming that the (IJs are binary, then only two transitions can 
emanate from each of the states at t = 4: one transition for I 5 = 0 and one transition for I 5 = 
1 . Thus, Fig. 3 shows two arrows terminating on each state at t = 4 (arrows are 
"backwards 11 because the backward algorithm is used). State transitions 301 and 302 
terminate at state sO; state transitions 303 and 304 terminate at state si; state transitions 
305 and 306 terminate at state s2; and state transitions 307 and 308 terminate at state s3. 

The branch metric m(It) of equation 18 represents a "distance" between the states 
and is used to select the state transition that corresponds to the longest path for each of 
the states s0-s3 at t = 4: 



10 
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10 



m &) = Z Z W*i 5 o ) log Pu(X 5 ,Y 5 ) 



<*i( Ko > Y, 4 ) Pij (X 5 , 0 , X^ 0 , Y 6 5 )log Pij (Y 5 1 X 5 ), 



(19) 



where fy( X* 0 , Y 6 5 ) = 1 , and X 5 = g 5 (S 5 , 1 5 ) by definition. There is an I 5 that corresponds 

to each of the state transitions 301-308. For this example, L(S 4 ) = m(I 5 ) corresponding to 
odd numbered state transitions 301-307 are greater than that for even numbered state 
transitions 302-308. Thus, odd numbered state transitions are "survivors." Each of them 
may be part of the state trajectory that has the longest path from t = 0 to t = 5. This 
transition (the survivor) is depicted by the solid arrow while the transitions with smaller 
lengths are depicted by dashed lines. 

The state sequence determination process continues by extending the survivors to 
t = 3 as shown in Fig. 4 forming state transitions 309-316. The distance between state 
transitions for each of the states are compared based on L(S 4 ) + m(I 4 ), where m(I 4 ) is 
shown in equation 20 below. 



For this example, the distances corresponding to the odd numbered state transitions 309- 
3 1 5 are longer than distances corresponding to even numbered state transitions 310-316. 
Thus, the paths corresponding to the odd numbered state transitions are the survivors. As 
shown in Fig. 4, the state transition 301 is not connected to any of the states at t = 3 and 
thus is eliminated even though it was a survivor. The other surviving state transitions 
may be connected into partial state trajectories. For example, partial state trajectories are 
formed by odd numbered state transitions 307-309, 303-31 1, 303-313 and 305-315. 

The above process continues until t = 0 is reached as shown in Fig. 5 where two 
surviving state trajectories 320-322 are formed by the surviving state trajectories. All the 
state trajectories terminate at state zero for this example because, usually, encoders start 



at state zero. As shown in Fig. 6, the state trajectory that corresponds to the longest 



cumulative distance is selected and the input information sequence l\ (via S t+I = f t (S t ,It) 
that corresponds to the selected trajectory is selected as the next estimated input 



m &) 2>5.y(X?,o ) log Pij (X 4 ,Y 4 ) 



o,( Xf >0 , Y, 3 )Pij(X 4 o, Y 4 )pj( X 5>0 , Y 5 )log Pij (Y 4 1 X 4 ). 



(20) 
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information sequence I U1 . For this example, the state trajectory 320 is selected and the 

input information sequence if corresponding to the state trajectory 320 is selected as Ij , . 

Fig. 7 shows an exemplary block diagram of the receiver 104. The receiver 104 
may include a controller 202, a memory 204, a forward processor 206, a backward 
5 processor 208, a maximal length processor 210 and an input/output device 212. The 

above components may be coupled together via a signal bus 214. While the receiver 104 
is illustrated using a bus architecture, any architecture may be suitable as is well known to 
one of ordinary skill in the art. 

All the functions of the forward, backward and maximal length processors 206, 
10 208 and 210 may also be performed by the controller 202 which may be either a general 
purpose or special purpose computer (e.g., DSP). Fig. 7 shows separate processors for 
illustration only. The forward, backward maximal length processors 206, 208 and 210 
may be combined and may be implemented by using ASICs, PLAs, PLDs, etc. as is well 
known in the art. 

1 5 The forward processor 206 generates the forward probability vectors 

a (x;- 1 , Y/- 1 ) herein referred to as a. For every iteration, when a new X* p (or I J, ) is 

generated, the forward processor 206 may generate a complete set of otj. 

The backward processor 208 together with the maximal length processor 210 
generate a new state sequence by searching for maximal length state transitions based on 
20 the branch metric m(It). Starting with the final state transition between states 

corresponding to t = T - 1 and t = T, the backward processor generates Pj(X* , p , Y*, ) 

(hereinafter referred as fy) as shown in equation 8 for each state transition. 

The maximal length processor 210 generates m(Ij) based on the results of the 
forward processor 206, the backward processor 208 and pyPC^Yt). After generating all 
25 the m(It)s corresponding to each of the possible state transitions, the maximal length 

*v 

processor 210 compares all the L(S t ) + m(It)s and selects the state transition that 
corresponds to the largest >t(S,) + m(It), and the I t (via S t+I = f t (S t , 1^) that corresponds to 
the selected state transition is selected as the estimated input information for that t. The 



A?-- 
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above process is performed for each t = 1, 2, T to generate a new estimate I* for each 
of the iteration p. 

Initially, the controller 202 places an estimate of the PDM P(X, Y) and n in the 

memory 204 that corresponds to the HMM for the channel 106 and/or the transmitter 102. 
5 The PDM P(X,Y) may be obtained via well known training processes, for example. 

When ready, the controller 202 receives the received input information sequence 

Y, T and places them in the memory 204 and selects an initial estimate of lj 0 (or X^ 0 ). 

The controller 202 coordinates the above-described iterative process until a new estimate 

I*, (or X*, ) is obtained. Then, the controller 202 compares I* 0 with I*, to determine if 

10 the compare result is below the compare threshold value (e.g., matching a predetermined 
number of elements or symbols of the information sequence). The compare threshold 

may be set to 0, in which case I* 0 must be identical with I*j . If an acceptable compare 

result is reached, lj x is output as the decoded output. Otherwise, the controller 202 

iterates the above-described process again and compares the estimated I* with I* , 

1 5 until an acceptable result is reached and I* is output as the decoded output. 

Fig. 8 shows a flowchart of the above-described process. In step 1 000, the 

controller 202 receives via the input/output device 212 and places Y, T in the memory 
204 and goes to step 1002. In step 1002, the controller 202 selects an initial estimate for 

I* 0 and goes to step 1004. In step 1004, the controller 202 determines a new state 

20 sequence and a next estimated if , (I* , where p = 1) (via the forward, backward and 
maximal length processors 206, 208 and 210) and goes to step 1006. In step 1006, the 
controller 202 compares lj 0 with ij , . If the compare result is within the predetermined 
threshold, then the controller 202 goes to step 1008; otherwise, the controller 202 returns 

"v 

to step 1004. In step 1008, the controller 202 outputs I* where p is the index of the last 

"- ^ 

25 iteration and goes to step 1010 and ends the process. 

Fig. 9 shows a flowchart that expands step 1004 in greater detail. In step 2000, 
the controller 202 instructs the forward processor 206 to generate a t as shown in 



\ n 



r ■ — 
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equation 8, and goes to step 2002. In step 2002, the controller 202 sets the parameter 
t = T and goes to step 2004. In step 2004, the controller 202 instructs the backward 
processor 208 to generate pj and the maximal length processor 210 to determine next set 
of survivors based on equation 18 and time t+1 survivors and goes to step 2006. 
5 In step 2006, the controller 202 decrements t and goes to step 2008. In step 2008, 

the controller 202 determines whether t is equal to 0. If t is equal to 0, the controller 202 
goes to step 2010; otherwise, the controller 202 returns to step 2004. In step 2010, the 

controller 202 outputs the new estimated 1^ and goes to step 2012 and returns to 
step 1006 of Fig. 5. 

10 A specific example of the iterative process for convolutional encoders is enclosed 

in the appendix. 

While this invention has been described in conjunction with specific embodiments 
thereof, it is evident that many alternatives, modifications, and variations will be apparent to 
those skilled in the art. Accordingly, preferred embodiments of the invention as set forth 
l y 1 5 herein are intended to be illustrative, not limiting. Various changes may be made without 

departing from the spirit and scope of the invention. 

For example, a channel may be modeled as P C (Y | X) = P C B C (Y | X) where P c is a 
channel state transition probability matrix and B C (Y | X) is a diagonal matrix of state 
output probabilities. For example, based on the Gilbert-Elliott model 



20 B C (X|X) = 



1-b, 0 
0 1-b 



2_ 



and B c ( X |X) = 



b x 0 
0 b 



2 J 



where X is the complement of X. For this case, mQJ may be simplified as 



7? /#/ mCy = £ Y U ( lj, )b,(Y t I XJ, t = 1 , 2, T, and 



Y,i( ll ) = otf Y/ | I| !P M Y t T +1 I ), where b/Y, I are the elements of 

B.. 



